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Gp41 -engineered loop = SEQ ID NO 7 and NO 8 



M Q472 ARQLLSGIVQQQNNLL 18 
ATG GAG GCC AGA CAA TTA TTG TOT GGT ATA GTG GAG GAG GAG AAC 
AATTTCCTG 



RAI EAQQHLLQLTVWGI K 36 
AGG GCT ATT GAG GCG CAA GAG CAT CTG TTG CAA CTC ACA GTC TGG 
GGCATCAAG 



QLQARI LAVERYLKDQQL 54 
GAG CTC GAG GCA AGA ATG CTG GCT GTG GAA AGA TAG CTA AAG GAT 
CAACAGCTC 



LGIDGSSGGRGGSNASWS 72 
CTG GGG ATT GAG GGT AGC AGT GGA GGT AGA GGT GGA TCC AAT GCT 
AGTTGGAGT 



NKSLEQI WNHTTWMEWDR 90 
AAT AAA TCT CTG GAA GAG ATT TGG AAT CAC AGG ACC TGG ATG GAG 
TGG GAG AGA 



EINNYTSLI HSLI EESQN 108 
GAA ATT AAC AAT TAG ACA AGC TTA ATA CAC TCC TTA ATT GAA GAA 
TCGCAAAAC 



QQEKNEQELLELDLEHHH 126 
GAG CAA GAA AAG AAT GAA CAA GAA TTA TTG GAA TTA GAT CTC GAG 
CAC CAC CAC 



H H H * 129 
CAC CAC CAC TGA 



FIGURE 1 
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2A 

Gp41 (region 545-684): SEQ ID NO 1 

QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIK 
QLQARILAVERYLKDQQLLGIWGCSGKLICTTAVP 
WNASWSNKSLEQIWNNMTWMEWDREINNYTSLIHS 
LffiESQNQQEKNEQELLELDKWASLWNWFNITNWL 



2B 

Gp41 regions 



Region 555-577 


SEQ ID NO 3 


QQQNNLLRAIEAQQHLLQLTVWG 


Region 572-601 


SEQ ID NO 4 


QLTVWGIKQLQARILAVERYLKDQQLLGIW 


Region 590-620 


SEQ ID NO 5 


RYLKDQQLLGIWGCSGKLICTTAVPWNASWS 


Region 628-663 


SEQ ID NO 6 


WNNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQ 



2C 

Linker oligopeptide: SEQ ID NO 2 

SGGRGGS 
2D 

Region 603-615: SEQ ID NO 15 : C S G K L I C T T A V P W (-13 residues) 
R^55&^ SBQIDNO 16: LGIWGCSGKLICTTAVP WNAS WSNK(-25ttsidues) 

FIGURE 2 
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SEQ ID NO 13 and 14 (GP41 matrix) : 

m Q472 ARQLLSGIVQQQNNLIi 488 
(ATG) CAG GCC AGA CAA TTA TTG TCT GGT ATA GTG CAG GAG CAG AAC AAT TTG CTG 
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SEQIDN0 17 

M Q(472) ARQIiLSGIVQQQNNLIi 18 

RAIEAQQHIiLQIiTVWGIK 36 

QLQARILAVERYIiKDQQL 54 

SGGRGGSSr.EQIWNHTTW 72 

MEWDREINNYTSIilHSLI 90 

EESQNQQKKNKQEIaXiEIiD 108 

KWASLWNWFNITNWLDHH 126 
H H H H 

FIGURE 4A 

SEQIDN018 

M Q(472) ARQ LLSGIVQQQNNLL 18 

RAIEAQQHIiLQIiTVWGIK 36 

QLQARILAVERYIiKDQQL 54 

SGGRGGSSLEQIWNHTTW 72 

MEWDREINNYTSIiIHSIiI 90 

EES QNQQEKNEQELLEIiD 108 

KWAS IiWNWFNI TNDHHHH 126 
H H 

FIGURE 4B 
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SEQrDN0 19 

M Q(472) ARQLLSGIVQQQNNLL 18 

RAIEAQQHLLQLTVWGIK 36 

QLQARILAVERYLKDQQL 54 

LGIWGSSGGRGGSSIiEQI 72 

WNHTTWMEWDREINNYTS 90 

LIHSLIEESQNQQEKNEQ 108 

EliLELDKWASIiWNWFNIT 126 

NWLDHHHHHH 136 

FIGURE 5A 

SEQIDNO20 

M Q(472) ARQLLSGIVQQQNNLr, 18 

RAIEAQQHLLQIiTVWGIK 36 

QLQARIIiAVERYLKDQQL 54 

LGIWGSSGGRGGSSIiEQI 72 

WNHTTWMEWDREINNYTS 90 

LIHSLIEESQNQQEKNEQ 108 

ELLELDKWASLWNWFNIT 126 

NDHHHHHH 134 

FIGURE 5B 
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SEQIDNO 21 

RAIEAQQHLLQLTVWGIK 36 
QLQARlLAVER YLKDOQli 54 
SGGRGGSSLEQIWNHTTW 72 



M E W D 

E E S Q 

K W A S 

H H H H 



REINNYTSIiIHSLI 90 
NQQEKNEQELLELD 108 
LWNWFNITNW LDHH 126 



FIGURE 6 
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SEQ ID NO 9 : 5' GG AAT CCA CAT ATG CAG GCC AGA CAA TTA TTG 3' 

SEQ ID NO 1 0 : 5' ACC GTT GGA TCC ACC TCT ACC TCC ACT GCT ACC GTC 
AAT CCC CAG GAG CTG TTG ATC 3' 

SEQ ID NO 11 : 5' GG AAT CCA GGA TCC AAT GCT AGT TGG AGT AAT AAA 
TCT CTG GAA 3' 

SEQ ID NO 12 : 5' GCC CGG CTC GAG ATC TAA TTC CAA TAA TTC TTG TTC 
ATT CTT TTC 3' 

FIGURE 7 
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